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O ; ABSTRACT 

, We use the conditional luminosity function (CLF) and data from the 2-degree Field 

' Galaxy Redshift Survey (2dFGRS) to constrain the average relation between light 

\ and mass in a ACDM cosmology with r2,„ — 0.23 and cts = 0.74 (hereafter WMAP3 

y—^ . cosmology) . Reproducing the observed luminosity dependence of the galaxy two-point 

^ ' correlation function results in average mass-to-light ratios that are ~ 35 percent lower 

^ ! than in a ACDM cosmology with = 0.3 and erg = 0.9 (hereafter WMAPl cos- 

OO . mology). This removes an important problem with previous halo occupation models 

which had a tendency to predict cluster mass-to-light ratios that were too high. For 
the WMAP3 cosmology, our model yields average mass-to-light ratios, central galaxy 
luminosities, halo occupation numbers, satellite fractions, and luminosity-gap statis- 

\ tics, that are all in excellent agreement with those obtained from a 2dFGRS group 

. catalogue and from other independent studies. We also use our CLF model to com- 

i-C ' pute the probability distribution P(A/|Lccn), that a central galaxy of luminosity Leon 

resides in a halo of mass M. We find this distribution to be much broader than what 

Q ■ is typically assumed in halo occupation distribution (HOD) models, which has im- 

^—^ ' portant implications for the interpretation of satellite kinematics and galaxy-galaxy 

C/3 , lensing data. Finally, reproducing the luminosity dependence of the pairwise peculiar 

■ velocity dispersions (PVDs) in the 2dFGRS requires relatively low mass-to-light ratios 
^ I for clusters and a satellite fraction that decreases strongly with increasing luminosity. 

• i-H . This is only marginally consistent with the constraints obtained from the luminosity 

' dependence of the galaxy two-point correlation function. We argue that a cosmology 

H \ with parameters between those of the WMAPl and WMAP3 cosmologies is likely to 

. . 1 yield results with a higher level of consistency. 
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1 INTRODUCTION 

Using the observed distribution of galaxies to constrain the 
cosmology dependent matter distribution requires a detailed 
knowledge of galaxy bias. The development of the halo 
model (see Cooray & Sheth 2002 for a detailed review), in 
which the matter distribution is interpreted in terms of its 
halo building blocks, has provided us with a convenient way 
to quantify galaxy bias. The concept is that all galaxies re- 
side in dark matter haloes, and that these haloes themselves 
are a biased tracer of the dark matter mass distribution, the 
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so-called halo bias. As a consequence of the hierarchical na- 
ture of structure formation, more massive haloes are more 
strongly clustered (Cole & Kaiser 1989; Mo & White 1996, 
2002), and the halo bias is thus an increasing function of halo 
mass. Galaxy bias is then completely specified by a descrip- 
tion of how galaxies of different properties are distributed 
over dark matter haloes of different masses. 

In the standard halo occupation models, one tries to 
constrain the halo occupation distribution (HOD) P{N\M), 
which expresses the conditional probability that a halo of 
mass M contains N galaxies (of a specified type). The first 
moment of this distribution function, {N}m, expresses the 
average number of galaxies as function of halo mass. To- 
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gether with the halo bias bh[M), this completely specifies 
the galaxy bias on large scales. On smaller scales, however, 
additional information is required, such as the second mo- 
ment of the HOD, {N{N — 1))m, and information regarding 
the spatial distribution of galaxies within individual haloes 
(e.g., Seljak 2000; Scoccimarro et al. 2001; Berlind & Wein- 
berg 2002; Cooray & Sheth 2002; Kang et al. 2002; Berlind 
et al. 2003). Additional constraints on the higher moments 
of P{N\M) can be obtained from the n-point correlation 
functions with n > 3, (Takada & Jain 2003; Zheng 2004b). 

Numerous studies have shown that the observed two- 
point correlation function of galaxies tightly constrains the 
first and second moments of P(A'"|A/) (e.g. ,Jing, Mo & 
Borncr 1998; Peacock & Smith 2000; Scranton 2003; Maglio- 
chetti & Porciani 2003; Zchavi et al. 2004, 2005; Tinker et 
al. 2005; CoUister & Lahav 2005), and that the resulting con- 
straints are in good agreement with the occupation statistics 
of dark matter subhaloes (e.g, Kravtsov et al. 2004; van den 
Bosch, Tormcri & Gioccoli 2005c). The HOD modeling has 
also been applied to various galaxy populations at medium 
to high redshifts (e.g., Bullock, Wechsler & Somerville 2002; 
Zheng 2004a; Phleps et al. 2006; Lee et al. 2006) as well as 
to quasars (Porciani, Magliocchctti & Norborg 2004). Fur- 
thermore, Zheng & Weinberg (2005) have shown that cos- 
mology and galaxy bias are not degenerate. This means that 
one cannot arbitrarily modify the HOD and fit the observed 
clustering of galaxies for any cosmology; the HOD technique 
can simultaneously constrain both the galaxy bias and cos- 
mology (see also Zheng et al. 2002; van den Bosch, Mo & 
Yang 2003b; Abazajian et al. 2005). 

Since we know that galaxy bias is a function of galaxy 
properties, such as luminosity and color, a natural extension 
of the HOD modeling is to consider the occupation statistics 
as a function of galaxy properties. In Yang, Mo & van den 
Bosch (2003) , we took a first step in this direction and intro- 
duced the conditional luminosity function (hereafter CLF). 
The CLF, $(L|Af)dL, gives the average number of galaxies 
with luminosity L ± dL/2 that reside in a halo of mass M. 
Integrating the CLF over a given luminosity range [Li,L2], 
yields the average number of galaxies with Li < L < L2 
that reside in a halo of mass M: 

{N)m = [ ^ ^L\M)dL, (1) 
Jli 

Thus, the CLF contains the same information as the first 
moment of the halo occupation distribution P{N\M), but 
it does so for any luminosity interval. In addition, the CLF 
also specifies the total, average luminosity in a halo of mass 
M, 

poo 

{L)m= / ^L\M)LdL, (2) 

and thus specifies the average relation between light and 
mass in the Universe. As shown in Yang et al. (2003), the 
CLF is tightly constrained by the observed luminosity func- 
tion and the correlation lengths of the galaxy population as 
function of luminosity. In subsequent papers, the CLF has 
been used to study the occupation statistics as function of 
both luminosity and galaxy type at low rcdshift (van den 
Bosch, Yang & Mo 2003a; Yang et al. 2005b; Cooray 2005a, 
2006) and high redshift (Yan, Madgwick & White 2003; 
Cooray 2005b, 2006), to study the environment dependence 



of the galaxy luminosity function (Mo et al. 2004), to con- 
strain cosmological parameters (van den Bosch et al. 2003b; 
Tinker et al. 2005), to study the pairwise peculiar velocity 
dispersion of galaxies (Yang et al. 2004; Jing & Borner 2004; 
Li et al. 2006), to construct detailed mock galaxy redshift 
surveys (Yang et al. 2004; Yan, White & Coil 2004; van 
den Bosch et al. 2005a) and to investigate the luminosity 
and type dependence of the three-point correlation function 
(Wang et al. 2004). In addition, the CLF has proven a use- 
ful aid for interpreting the kinematics of satellite galaxies 
(van den Bosch et al. 2004), for constructing galaxy group 
catalogues (Yang et al. 2005a), and for furthering our un- 
derstanding of the galaxy luminosity function (Cooray & 
Milosavljevic 2005). 



Clearly, the CLF formalism is a powerful, statistical tool 
that has many applications. However, the occupation statis- 
tics inferred from the observed clustering data are cosmol- 
ogy dependent. Virtually all studies mentioned above have 
adopted a ACDM concordance cosmology with a matter 
density ftm = 0.3 and a Harrison-Zel'dovich, initial power 
spectrum with a normalization erg = 0.9. Recently, how- 
ever, the 3- year CMB data of the WMAP mission (Hinshaw 
et al. 2006; Page et al. 2006) has argued in favor of a flat 
ACDM cosmology with a significantly reduced ilm and as, 
and with a spectral index that is significantly smaller than 
unity (Spergel et al. 2006). This has a non-negligible impact 
on the halo mass function and the halo bias, both of which 
play an important role in the HOD modeling. The purpose 
of this paper is to revisit some of our conclusions based on 
the CLF formalism in the revised concordance cosmology. 
In particular, we want to investigate (i) whether we can still 
simultaneously fit the galaxy LF and the luminosity depen- 
dence of the galaxy correlation length, (ii) what this implies 
for the average relation between light and mass in the uni- 
verse, and (iii) whether the resulting CLF is consistent with 
a range of other observations, including various occupation 
statistics inferred from galaxy group catalogues, the lumi- 
nosity dependence of the pairwise peculiar velocity disper- 
sions, and the satellite fractions inferred from galaxy-galaxy 
lensing studies. Finally, we improve upon our previous anal- 
ysis by taking account of the scale dependence of the halo 
bias, and by properly modeling the observational data over 
the light-cone. 



This paper is organized as follows. In §2 we describe the 
theoretical framework of the CLF formalism, including a de- 
scription of modeling on the light-cone, and we discuss how 
small changes in cosmological parameters impact on various 
statistics of dark matter haloes. §3 presents the new CLF for 
the new WMAP concordance cosmology, and compares the 
mass-to-light ratios, satellite fractions, and the luminosity 
gap statistic predicted by that model to observational data. 
§4 presents a detailed analysis of the halo occupation num- 
bers predicted by our CLF model, which are compared to 
other HOD models. In §5 we use mock galaxy redshift sur- 
veys to study the pairwise velocity dispersions of galaxies 
and their luminosity dependence. We summarize our results 
in §6. 
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Figure 1. Several characteristics of darlc matter haloes at 2 = in the WMAPl and WMAP3 cosmologies. From left to right the panels 
show the halo mass functions, the halo bias as function of halo mass, and the halo concentration parameter Cvir, again as function of halo 
mass. Note that in the new WMAP3 cosmology, the abundance of massive haloes is strongly reduced. In addition, dark matter haloes 
are more strongly biased and less strongly concentrated. 



2 THEORETICAL FRAMEWORK 
2.1 Light-Cone Modeling 

As shown in Yang et al. (2003), the CLF can be tightly 
constrained by fitting it to the galaxy luminosity function, 
"1>(L), and to the galaxy correlation lengths as function of 
luminosity, ro{L). A complication arises from the fact that 
these observational data have been determined on a light- 
cone. In particular, each data point typically derives from a 
different light-cone specified by different redshift limits Zmin 
and Zmax- For example, in a flux-limited sample, one has 

that 2max — Zmax(,L/) . 

Since the halo mass function and the halo bias are both 
functions of redshift, one needs to properly integrate the 
model over the light-cone before comparing it to the data. 
This becomes more and more important when Zmax 3> Zmin. 
However, even for relatively nearby surveys, such as the 2dF- 
GRS considered in this paper, ignoring this light-cone mod- 
eling may result in errors of 5 to 15 percent. Typically the 
errors are larger for brighter samples, since they cover a 
larger (deeper) volume. 

Within the CLF formalism, the light-cone integrated 
LF is given by 



4.(L) 



dz 



dV_ 
d7 



dM${L\M,z)n{M,z) 



(3) 



where dV/dz is the comoving volume element per unit solid 
angle. In what follows we will assume that the CLF does 
not evolve with redshift, i.e., ${L\M,z) = ${L\M), which 
implies that we can write 



$(L) 



with 



$(L|M) 7ioff(M) dAf 



neff(M) = - 



dV 

dz — n(M,z) 
dz 



(4) 



(5) 



Note that this effective mass function is different for each 
data point, i.e., for each different (zmin, ^max). 

In the case of the clustering data we proceed as follows. 
At a given redshift, and on large scales, the two-point cor- 



relation function for dark matter haloes of mass AI can be 
deflned as 



Chh(r, M, z) = 6h(M, r, z) ^dm(r, z) 



(6) 



with bh{M, r, z) the scale-dependent halo bias, and ^dm('", z) 
the evolved, non-linear correlation function of the dark mat- 
ter at redshift 2. In what follows we assume (i) that the 
mass dependence of the halo bias is separable from the scale 
dependence, and (ii) that the scale dependent part is inde- 
pendent of redshift. We thus write that 



bUM,r,z)^bliM,z)ar) 



(7) 



Using large numerical simulations Tinker et al. (2005) have 
shown that assumption (i) is accurate and that, at z = 0, 

[l + 1.17edm(r,0)]i-^9 



[l + 0.69U.('-,0)]2"9 



(8) 



Our assumption (ii) implies that this equation also holds at 
z > 0. Although this is untested at present, and may well 
be incorrect, it is unlikely to have a significant effect on our 
results. After all, the correction for the scale dependence of 
the halo bias is only important for r < 3h~^ Mpc, which is 
smaller than the scales probed here. Indeed, if we completely 
ignore the scale dependence (i.e., if we set ( — 1), we obtain 
results that are only ~ 5 percent different, which is smaller 
than the measurements errors. 

The mass and redshift 'dependence of the halo bias can 
be written as 



bh{M,z) = H- 



y^5c{z) 



{w^Y + b{l- c){l- c/2) 



(9) 



with a = 0.707, 6 = 0.5, c = 0.6, and u = v{M,z) = 
5c{z)/a{M) (Sheth, Mo & Tormen 2001). Here S^z) is the 
critical overdensity required for spherical collapse at redshift 
z, and cj{M) is the linear theory rms mass fluctuation on the 
mass scale M. Tinker et al. (2005), using state-of-the-art nu- 
merical simulations, re-investigated the mass dependence of 
the halo bias, and found that equation (9) accurately fits 
their simulations, but with b = 0.35 and c = 0.80, which are 
the values we adopt throughout. 
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Table 1. Cosmological Parameters. 



ID 








h 






M* 


„dm 


o-O 


ai 


(12 


CO 


Cl 


C2 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


(12) 


(13) 


(14) 


(15) 


WMAPl 


0.30 


0.70 


0.04 


0.70 


1.0 


0.90 


8.57 X 10^2 


5.10 


-0.056 


0.994 


-0.001 


11.07 


-2.49 


0.11 


WMAP3 


0.238 


0.762 


0.041 


0.734 


0.951 


0.744 


1.36 X 10^2 


4.27 


-0.078 


0.991 


-0.002 


7.92 


-1.70 


0.03 



Parameters of the two cosmological models discussed in this paper. Column (1) indicates the name by which we refer to these cosmologies 
in the paper. Columns (2)-(7) list the matter density, the energy density associated with the cosmological constant, the baryon density, 
the Hubble parameter, the spectral index, and the power-spectrum normalization. Column (8) lists the characteristic halo mass (in 
M0), defined as the mass scale at which the mass variance a{M) = 1.68. Column (9) lists the correlation length (in h^^ Mpc) of 
the evolved, non-linear matter field at z = 0. Finally, columns (10) to (15) list the fitting parameters that describe the relation between 
M and Mvir (equation [15]) and between halo concentration Cvir and M (equation [16]). 



The two-point correlation function for dark matter 
haloes of mass M in the volume of a hght-cone with Zmin < 
z < 0max is given by 

, , £'"^''dz%r,:'{M,z)bl(M,r,z)i^^{r,z) 

J.„„ (\z — n^{M,z) 

(cf. Hamana et al. 2001). Using that, on sufficiently large 
(linear) scales, ^dm(f, = i?^(2)^dm('', 0), with D{z) linear 
growth rate normalized to unity at 2 = 0, we obtain that 

^hh(r,M) = &teff(M)C(r)$dm(r,0) (11) 

with 

blMM) = rt.':.\'\ (12) 

Using this effective halo bias, we can write the two-point 
correlation function for galaxies of luminosity L, on large 

scales, as 



Cgg('r,L) =63,eft(L)C(r)5dm(r,0) 



(13) 



where the effective galaxy bias bg^es{L) is related to the 
eflfective halo bias, the effective halo mass function, and the 
CLF according to 



6g,eff(i) 



Jo 



^{L\M)bhMM)n^s{M)AM (14) 



2.2 Cosmology 

From the above it is clear that the computation of the galaxy 
correlation lengths, ro(i), defined by ^gg(ro,L) = 1, re- 
quires the halo mass function, n{M,z), the halo bias func- 
tion, bh{M,z), and the dark matter correlation function, 
(.dYn{r,z), all of which are cosmology dependent. In this pa- 
per we focus on a flat ACDM cosmology with a matter den- 
sity = 0.238, a baryonic matter density f2(, = 0.041, a 
Hubble parameter h = ffo/(100 kms"^ Mpc"^) = 0.734, 
a power-law initial power spectrum with spectral index 
Us = 0.951 and a normalization as = 0.744. These are the 
parameters that best-fit the 3-year WMAP data (Spergel 
et al. 2006), and we will refer to this cosmology in what 
follows as the WMAP3 cosmology. For comparison, we also 
compare some of our results to a flat ACDM cosmology with 
fim = 0.3, = 0.04, h = 0.7, n, = 1.0 and aa = 0.9. With 
strong support from the first year data release of the WMAP 
mission, this model has has been considered the concordance 



'cosmology' in the vast majority of all HOD studies. In what 
follows we will refer to a cosmology with these parameters 
as the WMAPl cosmology. 

Throughout this paper we compute the halo mass func- 
tion using the form suggested by Sheth, Mo & Tormen 
(2001), which has been shown to be in excellent agreement 
with numerical simulations as long as halo masses are de- 
fined as the masses inside a sphere with an average over- 
density of 180 (Jing 1998; Sheth & Tormen 1999; Jenkins et 
al. 2001; White 2002). Therefore, in what follows we consis- 
tently use that definition of halo mass when referring to M. 
The linear power spectrum of density perturbations is com- 
puted using the transfer function of Eisenstein & Hu (1998), 
which properly accounts for the baryons, while the evolved, 
non-linear power spectrum, required to compute the dark 
matter correlation function, is computed using the fitting 
formula of Smith et al. (2003). 

The left-hand panel of Fig. 1 plots the halo mass func- 
tions at « = for the WMAPl and WMAP3 cosmologies. 
Note that the new concordance cosmology predicts much 
fewer massive haloes: in fact, the number density of haloes 
with M = lO^^h'^ Mq (10"/i"^ M0) is only 19 percent (48 
percent) of what it is in the WMAPl cosmology. Clearly, 
all galaxies assigned to these haloes in the WMAPl HOD 
models now have to be redistributed over other haloes. The 
middle panel of Fig. 1 plots the halo bias at z = as func- 
tion of halo mass. Although the overall clustering strength of 
the dark matter is reduced in the WMAP3 cosmology with 
respect to that in the WMAPl cosmology (see Table 1), 
the halo bias has become larger. The difference is largest at 
M ~ 3 X W^^h~^ Mq, where the halo bias is ~ 1.5 times 
larger than in the WMAPl cosmology. 

As mentioned above, our halo masses M are defined as 
the masses inside a sphere with an average overdensity of 
180. Another definition of halo mass that is often adopted is 
the so-called virial mass, Mvir, which indicates the mass in- 
side a sphere with an average density equal to Avir times the 
critical density for closure. The value of Avir follows from 
the solution to the collapse of a spherical top-hat perturba- 
tion under the assumption that the halo has just virialized, 
and depends on cosmology through ilm (z) (Peebles 1980; see 
Bryan & Norman 1998 for a useful fitting function). Under 
the assumption that the density distribution of dark matter 
haloes is well fit by a NFW profile (Navarro, Frenk & White 
1997), one can convert M to Mvir (and vice versa) as long as 
one knows the halo concentration parameter Cvir- Using the 
Cvir (Mvir) of Macclo et al. (2006), we find that the relation 
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between M and Mvir is accurately fit (to better than one per- 
cent over the mass range W^h~^ Mq < M < 10^^h~^ Mq) 
by 



log 



Mv 



1012/1-1 M, 







= ao + aiy + 02j/ 



(15) 



with y = log[M/10^^/i ^ Mq]. Over the same mass range, 
the relation between Cvir and M (not Mvir) is accurately fit 

by 



= Co 



ciy + C2y 



(16) 



The best-fit parameters and Ci {i — 0, 1,2), for both the 
WMAPl and the WMAP3 cosmologies, are listed in Tabic 1. 
The resulting Cvir(M) are shown in the right-hand panel of 
Fig. 1: in the WMAP3 cosmology halo concentrations are 
~ 30 percent smaller than in the WMAPl cosmology. This 
may have a non-negligible impact on the two-point correla- 
tion function on small scales (r < 50h~^ kpc). In addition, 
this reduction in halo concentrations also has important im- 
plications for the TuUy-Fisher zeropoint (see discussions in 
van den Bosch et al. 2003b and Button et al. 2006). 



2.3 The Conditional Luminosity Function 

Following Yang et al. (2003) we parameterize the CLF by a 
Schechter function: 



$(L|M)dL 



I* 



exp(— L/L*) dL, 



(17) 



where L* = L*{M), a = a{M) and i>* = l>*(M) are all 
functions of halo mass M. We write the average, total mass- 
to-light ratio of a halo of mass M as 



{M/L)i 



2 \L Jo 



(18) 



This parameterization has four free parameters: a character- 
istic mass Ml, for which the mass-to- light ratio is equal to 
(M/L)o, and two slopes, 71 and 72, that specify the behavior 
of {M/L)m at the low and high mass ends, respectively. 

A similar parameterization is used for the characteristic 
luminosity L*{M): 



M 



L*{M) 



f).«^) 



vmT; '^\Ah) 



with 



fi&) 



r(a + 2) 
r(a + 1,1) 



(19) 



(20) 



Here T{x) is the Gamma function and r(a, x) the incomplete 
Gamma function. This parameterization has two additional 
free parameters: a characteristic mass M2 and a power-law 
slope 73. 

In our previous CLF studies, we used to set {M/L)m = 
(M/L)ci for haloes with M > Mq, with (M/L)ci a 

free parameter that describes the average mass-to-light ratio 
of clusters. This was motivated by a number of observational 
studies (Bahcall, Lubin & Norman 1995; Bahcall et al. 2000; 
Sanderson & Ponrnan 2003), which indicated that {M/L)m 
is roughly constant on the scale of gala^Ky clusters. This was 
further supported by semi-analytical models of galaxy for- 
mation, which revealed a similar behavior (see Fig. 14 in 



van den Bosch et al. 2003a). However, a number of stud- 
ies have suggested that the average mass-to-light ratio of 
clusters continues to increase with mass (Adami et al. 1998; 
Girardi et al. 2002; Marinoni & Hudson 2002; Bahcall & 
Comerford 2002; Lin, Mohr & Stanford 2003, 2004; Ramolla 
et al. 2004; Rines et al. 2004; Vale & Ostriker 2004, 2006; 
Popesso et al. 2005). Especially the more recent studies have 
convincingly shown that {M/L)m oc jv4"°-2±o.08 ^-^^ scale 
of clusters, virtually independent of the photometric band 
in which the luminosities arc measured (Popesso ct al. 2005, 
and references therein). In this paper, we therefore do not 
force {M/L)m to become constant at large M. Rather we 
simply adhere to the functional form of equation (18), ac- 
cording to which {M/L)m oc at large M. As wo will 
show below, this actually yields values for 72 that are in ex- 
cellent agreement with the cluster data discussed above. In 
order to allow for a comparison with our previous models, 
wc define {M /L)ci as the mass-to-light ratio for haloes with 
M = 10"/i-^ Mq. 

For a{M) we adopt a simple linear function of log(M), 



a(M) = ai5 -h 7? log(Mi5), 



(21) 



with Mis the halo mass in units of 10^^ /!.~^Mq, Q15 = 
q;(Mi6 = 1), and r? describes the change of the faint-end 
slope a with halo mass. Note that once a. and L* are given, 
the normalization $* of the CLF is obtained through equa- 
tion (18), using the fact that the total (average) luminosity 
in a halo of mass M is given by 



{L)m= $(L|M)LdL = **L*r(a-h2). 
Jo 



(22) 



Finally, we introduce the mass scale Mmin below which we 
set the CLF to zero; i.e., we assume that no stars form 
inside haloes with M < Mmin- Motivated by reionization 
considerations (see Yang et al. 2003 for details) we adopt 
Mmin = 10^/i~i M© throughout. Note, however, that this 
lower mass limit does not significantly influence our results. 
Changing Mmin to 10®/i"^ M© or 10^°/i"^ M© has only a 
very modest impact on the results presented below. 

As we will show below, this Schechter-function param- 
eterization of the CLF yields good fits to the data. In addi- 
tion, using galaxy group catalogues, Yang et al. (2005b) has 
shown that the Schechter form is also consistent with direct 
observations of the CLF. However, it is important to keep 
an open mind for alternative functional forms (see for exam- 
ple Zheng et al. 2005; Cooray & Milosavljevic 2005; Cooray 
2005a, 2006). 



2.4 Centrals and Satellites 

The CLF parameterization presented above does not distin- 
guish a priori between central and satellite galaxies. This 
is somewhat unfortunate, as there are good reasons to treat 
these kinds of galaxies separately (see Zheng et al. 2005, and 
references therein). Although it is straightforward to devise 
CLF parameterizations with a natural, a priori split in cen- 
tral and satellite galaxies (e.g., Cooray & Milosavljevic 2005; 
Cooray 2005a, 2006), wc stick to our Schechter-function pa- 
rameterization and apply an a posteriori split into central 
and satellite components. In particular, we assume that the 
central galaxy is always the brightest galaxy in a halo. 
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Table 2. Conditional luminosity function parameters. 
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Parameters of CLF models. Column (1) lists the ID by which we refer to each CLF in the text. Columns (2) to (10) list the CLF 
parameters obtained from the MCMCs. The upper two entries indicate the median and 68 percent confidence levels, the middle two rows 
list the parameters of the best-fit models, and the lower two rows list the parameters of three extreme models discussed in §5. Masses 
and mass-to-light ratios are in h'^^ Mq and h (M/L)q, respectively. 



Following van den Bosch ot al. (2004) we write the con- 
ditional probability distribution P(Lcen|A4")dLcen, with Lcen 
the luminosity of the central galaxy, as the product of the 
CLF and a new function /(Lcen, M) which depends on how 
galaxy luminosities are 'drawn' from the CLF: 



P(Lcen|M)dLcen = $ (L.en | M) /(L.en , M) dL^ 



(23) 



Since the CLF only gives the average number of galaxies 
with luminosities in the range L ± dL/2 in a halo of mass 
M, there are many different ways in which one can assign 
luminosities to the Ni galaxies of halo i and yet be con- 
sistent with the CLF. The simplest approach would be to 
simply draw Ni luminosities from $(L|M) and to associate 
Lcen with the luminosity of the brightest galaxy. Wo refer to 
this luminosity sampling as 'random', which results in the 
broadest P{Lccn\M) possible, at least when we adhere to 
the assumption that the central galaxy is the brightest halo 
galaxy. 

Alternatively, one could use a more constrained ap- 
proach, and, for instance, assume that the luminosity of the 
brightest (and hence central) galaxy is always larger than 
Li , defined by 



f 



*(L|M) AL=\ 



(24) 



Although Li = Li{M) is defined such that a halo has on 
average one galaxy with L > Li, demanding that Lcen > Li 
is equivalent to assuming that galcixy formation is somewhat 
deterministic and that the number of galaxies with L > 
Li(M) is always exactly one. Hereafter we shall refer to 
this sampling method as 'deterministic', which yields the 
narrowest P(Lcen|M) possible. 

In the case of 'deterministic' drawing one obviously has 

that 



/(Lcen,M) 



1 ifLcen>Ll(M) 
ifLcen<Li(M) 



(25) 



so that the expectation value for the luminosity of a central 
galaxy in a halo of mass M is simply given by 



(Lcen)il 



$(L|M) LdL = 3-* L* r(Q + 2, Li/L*) .(26) 



/(Lce„,M) 



exp(-C) 



with 



-f 

J he 



3>(L|M)dL 



(27) 



(28) 



In the case of 'random' drawing, one obtains that 



(sec Appendix B in van (ion Bosch ot al. 2004 for a deriva- 
tion), and the expectation value for Lcen has to be computed 
numerically. 

Unless specifically stated otherwise, in what follows we 
adopt the 'deterministic' sampling strategy because it allows 
various statistics of centrals and satellites to be computed 
analytically from the CLF. Most of the results do not sig- 
nificantly depend on this particular choice. Whenever the 
detailed form of /(Lcen,AL) is important, wo will present 
the results for both the 'deterministic' and the 'random' 
samplings. 



2.5 Parameter Fitting 

The CLF, as specified above, has a total of 8 free parameters: 
two characteristic masses; Mi and M2, four parameters that 
describe the various mass-dependencies 71, 72, 73 and r), 
a normalization for the mass-to-light ratio, {M/L)o, and a 
normalization of the faint-end slope, 015. The data that we 
uso to constrain the CLF consists of the 2dFGRS luminosity 
function of Madgwick et al. (2002) and the galaxy-galaxy 
correlation lengths as function of luminosity obtained from 
the 2dFGRS by Norberg et al. (2002). 

The LF of Madgwick et al. (2002) has been determined 
using the 2dFGRS data over the redshift range 0.01 < z < 
0.15, which we model using equations (4) and (5), with 
«min = 0.01 and Zmax = MIN[0.15, aiim(L)]. Here «iim(L) 
is the redshift at which the apparent magnitude of a galaxy 
of luminosity L is equal to the flux limit of the 2dFGRS, 
bj = 19.3 (CoUess et al. 2001). 

Norborg et al. (2002) defined a set of eight volume lim- 
ited samples, each defined by two luminosity limits, Lmin 
and Lniax, and two redshift limits, Zmin and Zmax- For each 
of these samples, they determined the correlation length, ro, 
defined by ^gg(T'o) = 1- We model this using equation (13) 
with 
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Figure 2. The data used to constrain the models (symbols plus errorbars), and the 68% and 95% confidence Umits from the WMAP3 
MCMC. The left-hand panel shows the galaxy luminosity function with the 2dFGRS data from Madgwick et al. (2002), while the right- 
hand panel shows the values of the galaxy-galaxy correlation function at the correlation lengths of the magnitude bins used by Norberg 
et al. (2002). For the data, these are unity by definition. Note that the model accurately fits the data. 



Dg.cff 



J^{N)m 6h.off(M) neff (M) dM 



(29) 



J^{N}Mn,B{M) dM 

where (A'') m is given by (1), but with Li and L2 replaced by 
the luminosity limits Lmin and Lmax of the volume limited 
sample under consideration. 

To determine the likelihood function of our free pa- 
rameters we use the Monte-Carlo Markov Chain (hereafter 
MCMC) technique (see e.g., Gamerman 1997). Each element 
of the chain is a model, consisting of 8 parameters. At any 
point in the chain we generate a new trial model by drawing 
the shifts in the eight free parameters from eight indepen- 
dent Gaussian distributions, centered on the current value 
of the corresponding model parameter. The probability of 
accepting the trial model is 



accept 



1.0 

exp[-(Xncw - Xold)/2] 



QCW 



if Xn, 
(^new 



< Xold 
Xnew ^ Xold 



Here = xl + Xro with 
$(LO-*(iO 



2 \ " 

X* = 



and 



2 

Xro 



1=1 ^ 



j=l 



A$(L,) 



Cgg(T'O,0 - 1 

A4g('-o,0 



(30) 



(31) 



(32) 



where ? indicates an observed quantity, and N,s> — 35 and 
Nr — 8 are the number of data points for the LF and the 
correlation lengths, respectively. Note that, by definition, 

ieg{ro,i) = 1. 

2.6 The Model 

Using the method described above we construct two chains 
consisting of 20 million models each, one for the WMAPl 



cosmology and another for the WMAP3 cosmology. Each 
chain is thinned by a factor 10** to remove the correlations 
between neighboring models (see van den Bosch et al. 2005a 
for details). The end result are two MCMCs consisting of 
2000 independent models each that properly sample the full 
posterior distributions. 

Fig 2 shows that the model based on the WMAP3 cos- 
mology accurately fits the galaxy LF and the galaxy corre- 
lation lengths as function of luminosity. The WMAPl cos- 
mology, however, yields an equally good fit to the data (not 
shown here, but see Fig. 3 in van den Bosch et al. 2005a). 
The fact that both cosmologies allow an equally good fit to 
these data, despite the large differences in halo mass func- 
tion and halo bias, illustrates that $(L) and ro{L) alone 
allow a fair amount of freedom in cosmological parameters 
(cf. van den Bosch et al. 2003b). However, as we will see 
below, the WMAPl and WMAP3 cosmologies predict sig- 
nificantly different mass-to-light ratios. 

Fig. 3 plots the posterior distributions of the CLE pa- 
rameters for both the WMAPl (red, unshaded histograms) 
and the WMAP3 (blue shaded histograms) MCMCs. The 
median and 68 percent confidence intervals of these distri- 
butions are listed in Table 2. A comparison of the WMAPl 
results presented here with those presented in van den Bosch 
et al. (2005a), shows small differences (all within the 68 per- 
cent confidence levels). These owe to the fact that (i) we use 
a new model for the halo bias, including its scale dependence, 
(ii) we properly model the data over its light-cone, and (iii) 
we no longer impose the constraint that the mass-to-light 
ratio is constant for haloes with M > 10^"* /i"^ M©. 

Comparing the WMAPl and WMAP3 results, one 
notes that several parameters, notably M2, a 15, 7;, 72 and 73, 
have virtually the same likelihood distributions for both cos- 
mologies. In the case of (M/L)ci, (M/L)o and Mi, however, 
the distributions for the WMAPl and WMAP3 cosmologies 
are clearly offset from each other. As we show below in more 
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Figure 3. Constraints on the nine CLF parameters obtained from our MCMCs. The shaded (blue) and non-shaded (red) histograms 
correspond to the WMAP3 and WMAPl cosmologies, respectively. The median and 68 percent confidence intervals of the distributions 
are listed in Table 2. Masses and mass-to-light ratios are in units of Mq and h{M/ L)q, respectively. 



detail, this mainly reflects the fact that the mass-to-light ra- 
tios predicted for the WMAP3 cosmology are significantly 
lower than for the WMAPl cosmology. 



3 RESULTS 

In what follows we use the MCMC presented above to make 
a number of predictions regarding the galaxy-dark matter 
connection. Where possible, we will compare these predic- 
tions to the results obtained from an analysis of a large 
catalogue of galaxy groups selected from the 2dFGRS using 
the halo-based galaxy group finder of Yang et al. (2005a). 
A short description of this group catalogue is presented in 
Appendix A. 



3.1 Mass-to-Light ratios 

As discussed above, the CLF allows one to compute the av- 
erage relation between fight and mass in the Universe. We 



present these results in terms of the average mass-to-light 
ratio as function of halo mass, {M/Lis)ai. Here Al is de- 
fined as the mass within a radius inside of which the average 
density is 180 times the background density, which can be 
converted into the virial mass Mvir using equation (15). The 
quantity Lis is the total luminosity in the photometric bj 
band of all galaxies brighter than Adtj — 5 log ft — —18.0, 
which is easy to obtain from the CLF. 

The upper panels of Fig. 4 show the 68 and 95 percent 
confidence limits on {A'I/Lis)m and Lccn(M) as obtained 
from our WMAP3 MCMC discussed above. The particular 
shape of {M/Lis}m holds important information regard- 
ing galaxy formation. For example, the pronounced mini- 
mum at A/ ~ 2 X 10^^h~^ Mq indicates the mass scale at 
which galaxy formation is most efficient. At lower masses 
(M/Lis) M increases dramatically, indicating that galaxy 
formation is unable to make galaxies with M(, ^ — 5 log h < 
— 18.0 in such low mass haloes. At the high mass end, the 
mass-to-light ratio also increases, though less rapidly, indi- 
cating that some processes, possibly including AGN feed- 
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Figure 4. Posterior constraints on a number of quantities computed from tlie WMAP3 MCMC. The contours show the 68% and 95% 
confidence limits from the marginalized distributions. Upper left-hand panel: The average ratio between M and Lis as function of halo 
mass. The (red) solid dots indicate the results from our 2dFGRS group catalogue (see Table Al), while the thick (black) line indicates 
the results from the 2PIGG group catalogue of Eke et al. (2004). Upper right-hand panel: The average relation between Lcen and M. 
Again, the (red) solid dots indicate the results obtained from our 2dFGRS group catalogue. Lower left-hand panel: The faint-end slope of 
the CLF, a, as function of halo mass M. Solid dots (red) and squares (black) correspond to the results obtained from our 2dFGRS group 
catalogue and the 2PIGG catalogue, respectively. Lower right-hand panel: The average satellite fraction as function of luminosity. Solid 
circles (red) and triangles (blue) indicate the satellite fractions of early and late type galaxies, respectively, obtained by Mandelbaum 
et al. (2006) from galaxy-galaxy lensing in the SDSS. The thick dashed line corresponds to the results obtained by TNWW (Tinker et 
al. 2006b) from an HOD analysis of the 2dFGRS, and the two thick crosses are the satellite fractions (and their 68% confidence limits) 
obtained by Cooray (2006) from a CLF analysis of the SDSS 



back, cause galaxy formation to also become relatively inef- 
ficient in massive haloes. 

The solid circles with errorbars correspond to the results 
obtained from our 2dFGRS group catalogue (Table Al), 
and are in excellent agreement with the CLF predictions. 
It is extremely reassuring that two completely different ap- 
proaches yield average mass-to-light ratios that are in such 



good agreement. Note that the errorbars indicate the ob- 
served scatter, not the error on the mean^. 

According to our parameterization, at the high mass 

^ Since the halo masses are estimated from the group luminosities 
(see Appendix A), this scatter is a lower limit on the true scatter. 
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end {M/Lis)m oc M"*^ . We obtain that 72 = 0.27i[;;'^^ (see 
Table 2), in good agreement with our 2dFGRS galaxy group 
results and with a wide range of additional studies (see 
§2.3). The solid (black) line, however, indicates the mass- 
to-liglit ratios obtained by Eke ct al. (2004) from their 2dF- 
GRS group catalogue called 2PIGG. Although the agree- 
ment with our CLF predictions and with the results from our 
2dFGRS group catalogue is good for M < 10^*h~^ Mq, the 
2PIGG-cataloguc yields that 72 ^ for M > lO^/i-^ Mq. 
Similar results wore obtained by Balicall, Lubiu & Nor- 
man (1995), Bahcall et al. (2000) and Sanderson & Pon- 
man (2003). To test whether the clustering data itself can 
discriminate between these results, we have constructed 
a CLF MCMC in which we set (M/L) = (Af/L)ci for 
M > 10"/r^ Mq. The resulting model can fit the observed 
$(L) and ro{L) equally well as the model presented here, in- 
dicating that the clustering data alone can not meaningfully 
constrain the slope of the relation between mass and light 
on the scale of clusters. The simple fact that two group cat- 
alogues constructed from the same data set (2dFGRS) yield 
predictions that are very different, accentuates the need for 
more thorough investigations. 

A comparison with Fig. 3 in van den Bosch et 
al. (2005a), shows that the mgiss-to-light ratios predicted by 
the CLF formalism are significantly lower in the WMAP3 
cosmology, compared to the WMAPl cosmology. This dif- 
ference is most pronounced near the minimum (M ~ 3 x 
10"/i"^ Mq, where the WMAPl cosmology predicts mass- 
to-light ratios that are ~ 45 percent higher. At the mas- 
sive end the difference is less pronounced, but at M = 
10"/i"^ Mq the WMAPl mass-to-light ratios are still ~ 25 
percent higher than for the WMAP3 cosmology. The reason 
for this change is a rather complicated mix of effects. First 
of all, in the WMAP3 cosmology there are much fewer mas- 
sive haloes. Secondly, changing the cosmology from WMAPl 
to WMAP3 decreases the dark matter correlation length 
(see Table 1). Consequently, galaxies have to become more 
strongly biased in order to match their observed correlation 
lengths. To some extent, this is automatically achieved by 
the fact that the halo bias is larger in the WMAP3 cosmol- 
ogy (cf. Fig. 1). However, since the strength of this effect 
is strongly dependent on halo mass, it is difficult to make 
intuitive predictions. Our analysis shows that all these ef- 
fects conspire to cause the average mass-to-light ratios to 
decrease on all mass scales. 

The fact that the average mass-to-light ratios are cos- 
mology dependent is of great importance. As we have shown 
in van den Bosch et al. (2003b, hereafter BMY03), it allows 
us to put tight constraints on cosmological parameters, in 
particular on Qrn and as- In principle, there is a wide range 
of cosmologies that allow one to accurately fit both $(L) and 
ro{L). Changing the cosmology typically implies a change 
in the halo bias and in the overall clustering strength of the 
dark matter. In order to maintain a good fit to the observed 
ro(L) one has to redistribute galaxies over haloes of different 
masses in order to counterbalance the changes in bh{M) and 
^dm(r). As long as these changes are not too large, one can 
always find a characteristic halo mass that has the right bias 
so that T\){L) is consistent with the data. However, all these 
different models will predict different mass-to-light ratios 
{M/L}m, simply because they require different halo occu- 
pation statistics. Therefore, any constraints on the average 



mass-to- light ratios of dark matter haloes, on any mass scale, 
will dramatically tighten the constraints on cosmological pa- 
rameters. 

One such constraint comes from galaxy clusters. Numer- 
ous studies, based on different techniques, have measured 
the mass-to-light ratios of clusters of galaxies (e.g, Carlberg 
et al. 1996 Bahcall et al. 2000; Lin, Mohr & Stanford 2004; 
Popesso et al. 2005). As shown in BMY03, all these measure- 
ments are in good agreement with each other and suggest 
that (Mvir/Ls)ci = (350 ± 70)h (M/L)©. Using this as a 
constraint on the CLF models puts tight constraints on the 
cosmological parameters. In fact, combining the CLF analy- 
sis with the first year WMAP results, BMY03 obtained that 
nm = 0.25l:o.o7 and ag = 0.78 ±0.12 (both 95% CL), in ex- 
cellent agreement with the 3-year results from the WMAP 
mission (Spergel et al. 2006). The main problem with the 
WMAPl cosmology is that it predicts mass-to- light ratios 
for clusters that arc too large (see also Tinker et al. 2005; 
Vale & Ostrikcr 2006). The good agreement, both among 
these different studies and with the latest CMB constraints, 
demonstrates the strength and reliability of the CLF formal- 
ism (or equivalent techniques), especially when combined 
with constraints on mass-to-light ratios. It indicates that 
halo occupation modeling has matured to the point where 
it can be used to obtain tight and reliable constraints on 
cosmological parameters (see also Zheng & Weinberg 2005). 



3.2 Faint End Slope of CLF 

The lower left-hand panel of Fig. 4 shows that our CLF 
model predicts that the faint-end slope of the CLF, a, de- 
creases with increasing halo mass. At around the cluster 
scale the models favor fairly steep faint-end slopes with 
a ~ —1.2 ± 0.3 (68% CL), in good agreement with inde- 
pendent studies of the luminosity functions of individual 
clusters (e.g., Sandage, Bingelli & Tammann 1985; Bcijers- 
bergen et al. 2002; Trcntham & Hodgkin 2002; Trentham & 
Tully 2002). Note, however, that since the CLF reflects the 
average luminosity function for haloes of a given mass, it is 
not necessarily a good description of the luminosity func- 
tions in individual systems. A more meaningful comparison 
is therefore with the CLFs that one can obtain directly from 
galax;y group catalogues, by combining all groups in a rela- 
tively narrow mass bin. 

The solid squares indicate the faint-end slope of the 
CLF thus obtained by Eke et al. (2004) from the 2PIGG 
group catalogue (see also Robotham et al. 2006), while the 
(red) solid dots correspond to the results obtained from 
our 2dFGRS group catalogue (see Yang et al. 2005b) . Both 
results are in reasonable agreement with each other and 
with the confidence limits obtained from our CLF analy- 
sis (though the latter are not very strict). Note that our 
results only extent to M <, W^*h~^ Mq; in order to obtain 
a sufficiently large number of massive groups, one needs to 
probe out to relatively high redsliifts. However, because of 
the flux-limit of the 2dFGRS we can not probe the CLF of 
these groups to sufficiently low luminosities to be able to 
extract a reliable measure of the faint-end slope. 
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3.3 Satellite Fractions 

The satellite fraction as function of luminosity, fsa,t{L), is 
an important quantity for a proper interpretation of galaxy- 
galaxy lensing measurements (Guzik & Seljak 2002; Man- 
delbaum et al. 2006; Yang et al. 2006) and for understand- 
ing pairwisc velocity dispersions (Slosar, Seljak & Tasitsiomi 
2006; see also §5). Therefore, it is useful to check what our 
CLF models predict in terms of /sat(i)- 

A satellite galaxy most likely resides in a more massive 
halo than a central galaxy of the same luminosity. Since halo 
bias depends on halo mass, the large scale bias of galaxies 
of a given luminosity depends strongly on what fraction of 
them are satellites. Consequently, the observed clustering 
strength as function of luminosity can put strong constraints 
on fss.t{L). In the 'deterministic' case (see §2.4), the satellite 
fraction as function of luminosity is given by 

/sat(L) = ^L}J^ ^WM) n{M) dM (33) 

with Mone defined according to Li{Mone) = L. In words, 
Mone is the mass scale at which one has exactly one galaxy 
brighter than L, which is easily computed from eq. (24) using 
a root finder. The lower right-hand panel of Fig. 4 shows 
the 68 and 95 percent confidence levels on /sat (L) computed 
from our CLF MCMC. The satellite fraction decreases with 
increasing luminosity, from 32 ± 6 percent at Mbj — 51og/i = 
-17 to 11 ± 4 percent at Mt , - 51og/i = -21 (both 68% 
CL). In the WMAPl cosmology the satellite fractions are 
about -|-5 percent higher, which is a small (but systematic) 
diflference compared to the model uncertainties for a given 
cosmology. 

For comparison, we ovorplot the results from three dif- 
ferent studios. The dashed line shows the satellite frac- 
tions corresponding to the fiducial HOD model of Tinker 
et al. (2006b, hereafter TNWW). This model is constrained 
by the luminosity dependence of the clustering strength in 
the 2dFGRS. The two crosses indicate the satellite fractions 
inferred by Cooray (2006) from a CLF analysis of the Sloan 
Digital Sky Survey (hereafter SDSS), with vertical error- 
bars indicating the 68% confidence levels. Finally, the solid 
circles and triangles with vertical errorbars (68% CL) indi- 
cate the satellite fractions of early- and late-type galaxies, 
respectively, inferred by Mandelbaum et al. (2006) from a 

galaxy-galaxy lensing analysis of the SDSS*. Remarkably, 
all these results arc in good agreement with each other and 
with our CLF constraints. 



3.4 Fossil Groups and the Luminosity-Gap 
Statistic 

Another useful statistic is the ratio L2/L1 of the luminosities 
of the second brightest and brightest galaxies in a given halo. 
As discussed in D'Onghia et al. (2005) and Milosavljevic et 
al. (2006) this statistic quantifies the dynamical age of a 
system of galaxies: haloes with L2/L1 close to unity must 

i For the results of Mandelbaum ct al. (2006) and Cooray (2006) 
we have converted the SDSS r-band luminosities to the 6j-band 
using the simplifying assumption that the luminosity ratio L/L* 
is independent of the photometric band. 



be relatively young, as dynamical friction will cause multiple 
luminous galaxies in the same halo to merge on a relatively 
short time scale. 

In Fig. 5 we compare this 'luminosity-gap' statistic 
obtained from our 2dFGRS group catalogue (shaded his- 
tograms) to results obtained from our CLF. To that extent 
we populate the dark matter haloes in a 300h-^ Mpc cos- 
mological siniTdation box (see Appendix B) with galaxies 
according to our best-fit CLF model (called WMAPSa in 
Table 2) using both the 'deterministic' and the 'random' 
formalisms described in §2.4. The results are shown as the 
dashed (green) histograms in the upper and lower panels, 
respectively. Both models clearly predict that the average 
luminosity gap increases with decreasing halo mass, in qual- 
itative agreement with the data. While the 'deterministic' 
model predicts that there is a deficit of low mass haloes 
with L2/L1 ~ 1, the 'random' model always predicts distri- 
butions of L2/L1 that peak at unity. Especially for haloes 
with 12 < log[M/{tr^ Mq)] < 13, the latter seems to be in 
better agreement with the data. However, the comparison is 
not entirely fair. After all, the data has been obtained from a 
group catalogue, which suffers from interlopers, incomplete- 
ness and errors in halo mass. 

We therefore use the populated simulation box to con- 
struct a mock galaxy redshift survey (as described in de- 
tail in Appendix B) to which we apply the group finder of 
Yang et al. (2005a). The solid (red) histograms in Fig. 5 
show the luminosity-gap statistic obtained from these mock 
group catalogues. Overall, the agreement between the model 
and the data is remarkably good, indicating that our CLF 
model predicts realistic values of L2/L1. Somewhat unfortu- 
nately, the differences between the 'deterministic' and 'ran- 
dom' models are now much suppressed, so that it is no longer 
possible to clearly discriminate between these two models. 
Note that the mock group catalogue yields a distribution of 
log[L2/ii] in the lowest halo mass bin which is very different 
from the true underlying distributions (dashed histograms), 
indicating that these low-mass haloes suffer quite substan- 
tially from interlopers, incompleteness effects and errors in 
halo mass. For the more massive haloes, however, the agree- 
ment between the true distributions and those obtained from 
the mock group catalogue is very satisfactory. 

Systems with a relatively large luminosity gap, which 
most likely owes to the fact that the brightest galaxies in the 
halo have merged, are often termed "fossil groups" and have 
received a significant amount of attention in the recent liter- 
ature (see Vikhlinin et al. 1999; Jones et al. 2003; D'Onghia 
et al. 2005; Milosavljevic et al. 2006; and references therein). 
Following Jones et al. (2003) and Milosavljevic et al. (2006) 
we define systems in which the second brightest galaxy is at 
least 2 magnitudes fainter than the brightest galaxy (i.e., 
log[L2/ii] < —0.8, indicated as dotted vertical lines in 
Fig. 5) as "fossil" systems. With our 2dFGRS group cata- 
logue, we are in a unique position to determine the fraction 
of fossil groups from a large and complete sample of opti- 
cally selected galaxy groups. We obtain that the fraction of 
fossil systems increases with decreasing halo mass from 3.6 
percent for groups with 14 < log[M/(/i"^ M©)] < 15 to 6.5 
percent for groups with 13 < log[M/{h~^ M©)] < 14 to 13.4 
percent for groups with 12 < log[M/(/i"^ M©)] < 13 (in all 
three cases the Poissonian errors are less than 0.1 percent). 
For comparison, Jones et al. (2003) obtained an incidence 
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Figure 5. The luminosity-gap statistic: Tlie (blue) shaded histograms indicate the distribution of log[L2/ii] obtained from our 2dFGRS 
group catalogue for four different bins in halo (group) mass, as indicated (all masses are in Mq). Here Li is the luminosity of the i"^ 
brightest galaxy in a given group (or halo). The dashed (green) histograms show the results obtained from our best-fit CLF model, when 
using either the 'deterministic' (upper panels) or 'random' (lower panels) sampling strategy. The solid (red) histograms show the results 
obtained from mock group catalogues constructed from these CLF models. These can be compared directly to the 2dFGRS results. Note 
that the overall agreement is very satisfactory, both for the 'deterministic' and for the 'random' sampling strategy. The vertical dashed 
lines correspond to log[L2/i'i] = —0.8 and mark the point where the second brightest galaxy is exactly two magnitudes fainter than the 
brightest galaxy. Groups to the left of this line are sometimes referred to as "fossil groups" . 



rate of 8 to 20 percent for systems with an X-ray luminosity 
from diffuse, hot gas of I/x,boi > 2.5 x fO*^/i~^ergs"^. Al- 
though this is relatively high compared to the fossil fractions 
in our 2dFGRS group catalogue, the latter have not been 
X-ray selected which complicates a straightforward compar- 
ison. In a recent paper, D'Onghia et al. (2005) used detailed 
hydrodynamical simulations to predict the fraction of haloes 
with M ~ 10"/i"^Mq that have logfLa/ii] < -0.8. From a 
total of twelve simulated groups, they obtain a fossil fraction 
of 33 ± 16 percent. Although consistent with ours at their 
2a level, the much lower fraction of fossil systems in our 
2dFGRS group catalogue suggests a potential over-merging 
problem in their simulations. 



4 HALO OCCUPATION STATISTICS 

In this section we describe the link between the CLF and the 
more often used HOD models. The latter aim at constraining 
the conditional probability distribution P[N\M) that a halo 
of mass M contains A'^ galaxies. Here, and in what follows, 
whenever we talk about the occupation numbers A'^, we mean 
the number of galaxies brighter than a given luminosity limit 
Lmin- Most studies to date have only focused on the first 
moment of P{N\M), which specifies the mean occupation 
numbers as function of halo mass. The same information 
can be extracted trivially from the CLF, for any Lmin, so 
that the relation between P{N\M) and <I>(L|M) is given by 



{N)m = P(N\M) = / ^{L\M)AL (34) 

It is interesting to compare the shape of this {N)m pre- 
dicted by the CLF, with the shape that is typically assumed 
in HOD models. Early HOD models often assumed that 
{N)m follows a simple power-law (Jing, Mo & Borner 1998; 
Seljak 2000; Scoccimarro et al. 2001; Scranton 2002; Rozo, 
Dodelson & Frieman 2004; Colhster & Lahav 2005) or a 
broken power-law (Berlind & Weinberg 2002; Magliocchetti 
& Porciani 2003). More recently, it has become practice to 
adopt a somewhat more complicated form, motivated by 
a separate treatment of central and satellite galaxies, i.e., 
{N)m = {Ncen)M + (A"aat)Af (e.g., Abazajiau et al. 2005; 
Sefusatti & Scoccimarro 2005; Zehavi et al. 2004; Zheng et 
al. 2005; Tinker, Weinberg & Zheng 2006a). In all these mod- 
els (A'sat)A/ is modeled as a power- law (sometimes with a 
break at small M), while {Nccn}M is considered to change 
from zero at low M to unity at high M, either via a sim- 
ple step function at a characteristic mass, or via a some- 
what broader transition function. These functional forms 
are largely motivated by the occupation statistics of dark 
matter subhaloes and of galaxies in numerical simulations 
and semi-analytical models (Kauffmann et al. 1999; Ben- 
son et al. 2000; Sheth & Diaferio 2001; Berlind et al. 2003; 
Kravtsov et al. 2004; Zheng et al. 2005). 

In the case of our CLF, no assumptions are made re- 
garding the functional forms of either (Ncen)M or {Nsat)M- 
In fact, we split the Schechter-function CLF a posteriori in 
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Figure 6. Occupation statistics of our best-fit CLF model in the WMAP3 cosmology. The upper panels plot the average number of 
galaxies brighter than a certain magnitude limit as function of halo mass. Prom left to right the upper panels show the occupation 
numbers for all galaxies, for central galaxies, and for satellite galaxies. In each panel, the different lines correspond to different magnitude 
limits: from left to right, Mi,j — 51ogh = —14, —15, —21. The lower panels show the combined occupation numbers for three different 
magnitude limits, as indicated. These figures illustrate that the functional form of the occupation numbers predicted by our CLF models 
are in qualitative agreement with HOD models, with numerical simulations, and with semi-analytical models for galaxy formation, but 
with one important difference: the zero-to-unity transition of (A^cen)M is significantly broader than in most HOD models (see text for 
detailed discussion). 



contributions from central and satellite galaxies. Following 
the 'deterministic' method described in §2.4, the occupa- 
tion statistics of central and satellite galaxies follow trivially 
from (N): if (A^) > 1 than (iVccn) = 1 and {iV^.t) = (N) - 1. 
On the other hand, if (iV) < 1 than (Nccn) = (iV) and 

(iV.at) = 0. 

With one additional assumption, one can in fact derive 
the full probability distribution P{N\M) from the CLF. Mo- 
tivated by the fact that dark matter subhaloes reveal Pois- 
sonian statistics (Kravtsov et al. 2004), it has become stan- 
dard to assume that the number of satellite galaxies follows 
a Poisson distribution. If we follow this assumption, which 
is also the standard procedure in HOD models, we have that 



P(iV|M) 



Pi(iV|A/) ifM>A'/o 
P2{N\M) otherwise 



(35) 



Here Mono is defined in §3.3, 

r if iV = 

MN\M) = ^ iiW^,,p[_^;V...)] otherwise 

and 



(36) 



r l-{iVccn) ifiV = 
P2(iVlM) = <^ (TVccn) ifiV=l 

I otherwise 



(37) 



The upper left-hand panel of Fig. 6 plots the average 
number of galaxies brighter than a given magnitude limit 
as function of halo mass. The magnitude limits are, from 
left to right, Mtj — 5\ogh — — 14, — 15, — 21. Results are 
only shown for the best-fit model from the WMAP3 MCMC, 
though the overall trends are qualitatively the same for all 
other models, including those for the WMAPl cosmology. At 
bright magnitude limits, (A*') m is close to a pure power-law. 
At fainter magnitude limits it starts to develop a low-mass 
shoulder, which, at the faintest magnitude limits, evolves 
into a separate peak. The upper middle and upper right- 
hand panels plot the corresponding occupation statistics of 
central and satellite galaxies, respectively (see §2.4). The 
lower panels plot {N)m, plus the contributions from central 
(dotted lines) and satellite (dashed lines) galaxies, for three 
magnitude limits, as indicated. Clearly, the functional forms 
of these occupation numbers are in qualitative agreement 
with the functional forms discussed above: (Ncan) m transits 
from zero at low M to unity at large M, while {Nsax) m is 
well approximated by a power-law at large M with an (ex- 
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Figure 7. The contours show the 68% and 95% confidence hmits on the halo occupation numbers obtained from our WMAP3 MCMC. 
Results are shown for three different magnitude limits, as indicated, and are compared to the results obtained from our 2dFGRS group 
catalogue (red, solid dots, see Table Al). 



ponential) truncation at low M. However, a more detailed 
comparison shows that the (A'^) m of TNWW reveal more 
pronounced shoulders at {N)m = 1 with a sharper trunca- 
tion at low M . The sharpness of this zero-to-unity transition 
is a measure for the scatter in the conditional probability 
function P(M|Lccn)) which is the topic of discussion in §4.2 

For the faintest magnitude limits considered here {N)m 
reveals a small 'bump' at low M. Although the presence of 
this bump is only marginally significant, in that there are 
also models within the 68 percent confidence levels of the 
MCMC that do not reveal such a bump, a similar feature 
was obtained by Magliocchetti & Porciani (2003) from an 
HOD analysis of the 2dFGRS. It thus seems that the data 
has a weak preference for such a feature, which implies that 
a significant fraction of the low mass haloes host a satellite 
galajcy that is virtually equally bright as the central galaxy. 
Interestingly, SPH simulations and semi-analytical models 
for galaxy formation reveal similar 'bumps' in the halo oc- 
cupation statistics, especially for blue galaxies (Sheth & Di- 
aferio 2001; Scranton 2003; Zheng et al. 2005). 

Finally, in Fig. 7 we compare the halo occupation num- 
bers predicted from our CLF MCMC with those obtained 
from our 2dFGRS group catalogue (cf. Table Al). Overall 
the agreement is satisfactory, especially for brighter mag- 
nitude limits. The group catalogue predicts a fiattening of 
{N)m at {N)m ~ 1, in disagreement with our CLF predic- 
tions. However, this is an artefact of the group finder: one 
can detect "groups" with A'' = 1, but not those with A'^ = 
(cf. Yang et al. 2005b). 



not depend strongly on the value of the magnitude limit 
used. The sudden dramatic increase of the confidence lim- 
its at Alb J — 5 log ft = — 20 owes to the fact that for some 
of the CLF models in the MCMC even the most massive 
haloes considered (M = 10^^h~^ ^q) have fewer than three 
satellites with Mbj — 5 log h < —20, so that 7 is not defined. 

The (red) solid dots correspond to the results obtained 
from our 2dFGRS group catalogue, also measured at A'^sat = 
3, and are in good agreement with the CLF constraints. 
The (green) horizontal bar indicates the constraints on 7 
obtained by TNWW from an HOD analysis of the 2dFGRS. 
Under the assumption that 7 is independent of the luminos- 
ity limit they obtained 7 = 1.03±0.03 (68% CL). This value 
is consistent with our CLF predictions at the 1 to 2cr level, 
but significantly higher than what we obtained from our 
2dFGRS group catalogue. The same applies to the (black) 
solid squares, which indicate the slopes of the occupation 
statistics of CDM sub-haloes. These have been obtained by 
Kravtsov et al. (2004) for five different number densities 
of CDM haloes in a large numerical simulation. We have 
converted these number densities to a bj band magnitude 
limit, using the 2dFGRS luminosity function of Madgwick 
et al. (2002). 

Finally, we emphasize that these comparisons have to be 
interpreted with some care. After all, our {A'aat)Af are not 
pure power-laws, neither for the CLF predictions, nor for 
the occupation statistics obtained from the 2dFGRS group 
catalogue. Consequently, the results depend somewhat on 
the value of A'sat at which the slope is measured. 



4.1 Power-law slopes 

To make our predictions regarding the occupation numbers 
somewhat more quantitative, we use our WMAP3 MCMC 
to compute the slope of {A^sat)jv/ at the high-mass end. 
Fig. 8 plots the 68 and 95 percent confidence limits on 
7 = dlog{A''sat)A//dlogM measured at {A'sat)A/ = 3 as 
function of the magnitude limit. This shows that there is 
a fairly large uncertainty on 7, especially for faint magni- 
tude limits. In addition, it shows that the value of 7 does 



4.2 Scatter in the relation between Lccn and M 

As mentioned above, the occupation statistics of TNWW 
seem to predict significantly sharper zero-to-unity transi- 
tions for {Ncen}M, which implies significantly less scatter in 
P(A/|I/con). The width of this conditional probability dis- 
tribution is interesting from the perspective of galaxy for- 
mation, as it contains information regarding the amount of 
stochasticity in galaxy formation. It is also important for 
a proper interpretation of galaxy-galaxy lensing measure- 
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Figure 8. The slope 7 = d log(Afsat)/d log M, measured at 
(A^sat) = 3, as function of the magnitude limit. Overplotted for 
comparison are the results obtained from the 2dFGRS group cat- 
alogue of Yang et al. (2005a), the results of Tinker et al. (2006b) 
obtained from a HOD analysis of the 2dFGRS, and the results 
obtained for dark matter subhaloes by Kravtsov et al. (2006b). 



ments (Mandelbaum et al. 2005) and of the kinematics of 
satellite galaxies (van den Bosch et al. 2004). 

We can use the CLF to compute the variance in log M 
of haloes that host a central galaxy of luminosity Lccn ■ This 
is given by 



2. M2 fMiV 

a log M] = — — — 

Mo \MoJ 



with 



Mk 



P(Lccn\M) [log Mr n{M) dM 



(38) 



(39) 



and with P{Lccn\M) given by equation (23). Here we have 
used that 



P(Af|Lccn)dM = 



P(L,en|M)n(M) 

/"P(iccn|Af)n(M)dM 



dM, 



(40) 



which follows from Bayes' theorem. 

Fig. 9 plots the 68 and 95 percent confidence limits 
on o-iogM(I'cen) obtained from our WMAP3 MCMC us- 
ing the "deterministic" (upper panel) and "random" (lower 
panel) P{Lccn\M). The CLF model predicts a pronounced 
increase of aiogjv/ with increasing Lccn- This is consistent 
with Fig. 6, which clearly shows that the zero-to-unity tran- 
sition of (Ncen) becomes less sharp for brighter magnitude 
limits. As expected, the scatter in P{M\Lcen) is higher in 
the "random" case compared to the "deterministic" case, 
especially at the faint end. 

The dashed lines in Fig. 9 indicate criogM(Lccn) for the 
B-band CLF from Cooray (2006) computed using (38). Al- 
though Cooray assumes a different functional form for the 
CLF, and uses a very different technique to constrain the 
CLF, the agreement with our results is remarkably good. 



The (red) solid dots show the results obtained by 
TNWW from an HOD analysis of the 2dFGRS. For Leon < 
W^°h~'^ L© TNWW have assumed that ai^gM = 0.15. For 
brighter centrals, however, they let criogM be a free parame- 
ter. Their best-fit values show a strong increase of cio^m 
with increasing Lcen- Although in qualitative agreement 
with our results and those of Cooray (2006), their values 
for (TiogA/ are much smaller. Unfortunately, TNWW do not 
give any uncertainties on their best- fit o"iogM, so that it is 
difficult to judge the significance of this difference. 

Given the relevance of the amount of scatter in 
P{M\Lccn) for, for example, weak lensing, it is important 
to try to obtain more direct constraints on uiogM- In More 
et al. (2006, in preparation), we use satellite kinematics to 

show that (Tlog M(Lccn) > 0.2 for icon > 3 X 10^h~^ Lq, 

which clearly rules out the relatively small scatter obtained 
(and assumed) by TNWW. It is unclear at present why these 
authors obtain a cric,gjv/(I'cen) that is so much smaller. For 
example, as shown in Zehavi et al. (2005), one can change 
the sharpness of the zero-to-unity transition of {Nccn)M and 
leave the fit to the galaxy-galaxy correlation function largely 
intact. 



5 PAIRWISE VELOCITY DISPERSIONS 

The peculiar velocities of galaxies are determined by the 
action of the gravitational field, and are therefore directly 
related to the matter distribution in the Universe. Conse- 
quently, the amplitude of galaxy peculiar velocities can yield 
useful, additional information regarding the universal rela- 
tion between light and mass. One statistic that is particu- 
larly useful in this respect is the pairwise velocity dispersion 
(PVD), (712 (r), which is a measure of the relative peculiar 
velocity of a pair of galaxies as a function of their separation 
r. The PVDs can be obtained from the data as described in 
§5.1 below. 

In Yang et al. (2004) we used detailed mock galaxy 
redshift surveys (hereafter MGRSs, see Appendix B) con- 
structed using our WMAPl CLF in order to investigate 
what these CLF models predict for the PVDs of 2dF- 
GRS galaxies. A comparison with the results of Hawkins 
et al. (2003) revealed that our MGRS based on the best- 
fit CLF dramatically over-predicts the PVDs at scales of 
~ Ih'^ Mpc by ~ 350 kms~^. Since the PVD is extremely 
sensitive to the few richest systems in the sample (i.e., Mo, 
Jing & Borner 1993) one can lower the PVDs by lowering 
the occupation numbers of massive haloes. Within the un- 
certainties of the CLF parameters we were able to find a 
model that could reproduce the observed PVDs. However, 
that model predicts an average mass-to-light ratio for clus- 
ters of ~ lOOO/i Mq/ L0, which is much larger (by more 
than 7a) than the average mass-to-light ratio obtained from 
other, independent measurements (see §3.1). 

Rather than lowering the average number of galax- 
ies per cluster, one can also lower the PVDs by reducing 
the abundance of massive haloes. This, however, implies a 
change of cosmology. In Yang et al. (2004) we showed that 
a flat ACDM cosmology with flm ~ 0.3 and erg = 0.75 
could accurately reproduce the observed PVDs with a re- 
alistic mass-to-light ratio for clusters. In fact, we used this 
result to argue against the WMAPl cosmology and in favor 
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Figure 9. The contours show the 68% and 95% confidence Hm- 
its on the standard deviation in log M as function of L 

ccn 5 as 

obtained from our WMAP3 MCMC. This quantity expresses the 
width of the conditional probabiUty distribution P(M|Lccn). Up- 
per and lower panels show the results for the "deterministic" and 
"random" methods, respectively. For comparison, we also show 
the results obtained by Tinker et al. (2006b), from an HOD anal- 
ysis of the 2dFGRS (red, solid dots), and by Cooray (2006) from 
a CLF analysis (dashed line). Note that both CLF studies predict 
a much broader P(M|Lcon) than the HOD study of Tinker et al. . 



of a cosmology with a reduced erg (see also van den Bosch 
et al. 2003b, 2005a and Yang et al. 2005a). 

The PVDs of Hawkins et al. (2003) were obtained from 
a large flux-limited sample. Although these already provide 
important constraints on the mass-to-light ratios of clusters 
(for a given cosmology), one can obtain even tighter con- 
straints on the universal relation between light and mass by 
measuring the PVDs as a function of galaxy luminosity. Jing 
& Borner (2004, hereafter JB04) were the first to present a 
PVD analysis for galaxies in different luminosity intervals. 
Using the 2dFGRS, they found that the PVD at a scale of 
k = IftMpc"^ has a minimum of ~ 425 kms^^ for galaxies 
with Mi,_, — 51og/i ~ —20.5. Fainter galaxies, with mag- 
nitudes in the range [—17,-19], were found to have much 
higher pairwise velocity dispersions of ~ 700 km s"'^, almost 



as high as those of the brightest galaxies in the 2dFGRS 
(see red, solid triangles in Fig. 10). This indicates that a 
significant fraction of the fainter galaxies must be satellite 
galaxies in massive haloes. 

A qualitatively similar result has been obtained by Li 
et al. (2006; hereafter LOG) from an analysis of the SDSS. 
In their case, however, a\2{L) measured at fc = l/i Mpc~^ 
reveals a smaller dynamic range; the minimum occurs at 
~ 500 kms^^ while the PVDs for the fainter galaxies are 
<, 600kms~^ (see blue, solid squares in Fig. 10). 

Using one of the CLF models presented in Yang et 
al. (2003), JB04 constructed a mock 2dFGRS which they 
analyzed in exactly the same way as the 2dFGRS data. Con- 
trary to the data, the model PVDs were found to increases 
monotonically from ~ 400kms~^ at Mbj — 51og/i — —17 to 
~ 750 km s"^ at Mb , - 5 log ft = -21. The CLF model thus 
severely underpredicts the PVDs of faint galaxies, and does 
not reveal the pronounced minimum near Mi,j — 5 log h ~ 
—20.5. However, there is a considerable amount of freedom 
in the CLF parameters. For example, as is evident from 
Figs. 3 and 4, within the 95 percent confidence limits there is 
a wide range of cluster mass-to- light ratios and satellite frac- 
tions that can fit both ^{L) and ro{L). Since the PVDs are 
extremely sensitive to these quantities (e.g., Mo et al. 1993; 
Slosar et al. 2006; Tinker 2006c), it is crucial that one takes 
this model freedom into account when comparing model and 
data. Here we will do so, analysing the luminosity depen- 
dence of the PVDs for our new WMAP3 CLF. 



5.1 The Luminosity Dependence of the PVDs in 
the 2dFGRS 

We start by performing our own analysis of the luminos- 
ity dependence of the PVDs in the 2dFGRS. First we select 
those galaxies in the final release of the 2dFGRS that are lo- 
cated in the North Galactic Pole (NGP) and South Galactic 
Pole (SGP) survey strips with a redshift quality parameter 
Q > 3, with 0.01 < z < 0.20, and with a redshift complete- 
ness > 0.7. These are used to construct ten volume-limited 
samples (adopting bj = 19.3 as the apparent magnitude 
limit of the 2dFGRS) whose magnitude and redshift limits 
are indicated in Table 3. 

Let Vp and n be the pair separations perpendicular and 
parallel to the line-of-sight, respectively. For each of our 
volume-limited samples we compute the two-point corre- 
lation function (2PCF) ^(rp,7r), using the estimator intro- 
duced by Hamilton (1993). Random samples are constructed 
from our MGRSs (see Appendix B) by randomizing the co- 
ordinates of all mock galaxies. We use this two-dimensional 
2PCF to compute the PVD from the galaxy power spectrum 
in redshift space, P''^\k,fi), which is related to the power 
spectrum in real space, P{k) according to 

P'''\k,fi) = P{k){l + /3fiyD[kfiai2{k)] (41) 

(Peacock & Dodds 1994; Cole, Fisher & Weinberg 1995) with 
k the wavenumber and /i the cosine of the angle between 
the wavevector and the line-of-sight. The factor (1-1- P/i^)^ 
accounts for the compression due to infall, with /? the lin- 
ear redshift-distortion parameter, while D[kfj.ai2{k)] is the 
damping function that accounts for the random motion of 
galaxies within dark matter haloes. We follow JB04 and LOG 
and assume that this damping function has a Lorentz form 
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Table 3. Pairwise Velocity Dispersions 



Magn. Limits 


Median Magn. 




■2^max 


N 


(Ti2(fc = 1/iMpc-i) 


Ml, J - 5 log h 


Mb , - 5 log h 








kms~^ 


(-17.5, -16.5] 


-16.99 


0.01 


0.05 


4892 


507 ± 122 


(—18.0, —17.0] 


— 17.48 


0.01 


0.06 


8144 


532 ± 82 


(-18.5, -17.5] 


-17.99 


0.01 


0.07 


12525 


592 ± 38 


(-19.0, -18.0] 


-18.49 


0.01 


0.09 


24334 


574 ± 52 


(-19.5,-18.5] 


-18.96 


0.02 


0.11 


35461 


527 ± 43 


(-20.0, -19.0] 


-19.43 


0.02 


0.13 


41438 


470 ± 48 


(-20.5, -19.5] 


-19.90 


0.02 


0.16 


43600 


451 ± 44 


(-21.0, -20.0] 


-20.36 


0.04 


0.20 


36383 


413 ± 28 


(-21.5, -20.5] 


-20.79 


0.05 


0.20 


12853 


694 ± 134 


(-22.0, -21.0] 


-21.24 


0.06 


0.20 


2840 


993 ± 289 



Column (1) specifies the absolute magnitude limit for each volume limited sample, while the median magnitude is listed in column (2). 
The minimum and maximum redshifts of each sample axe listed in columns (3) and (4) , respectively, and the total number of 2dFGRS 
galaxies in each sample is listed in column (5). Note that only galaxies with a redshift completeness greater than 0.7 are selected. Finally, 
column (6) lists the PVDs in the 2dFGRS measured at k = Ih Mpc~^, plus the (cosmic variance) error determined from 8 MGRSs. 



D[kn(7i2{k)] 



1 + -^k^n^(Ti2{k) 



(42) 



and compute the redshift space power spectrum for each 
volume limited sample in Table 3 by Fourier transforming 
the corresponding 2PCF: 



2tv J dn J drprp^(rp,7r) cos(fc,r7r) 
JoikpTp) W{rp,Tr) 



(43) 



(JB04)S . Here Jo is the zeroth-order Bessel function, fc^r and 
kp are the wavenumbers perpendicular and parallel to the 
line-of-sight, and 



W{r 



oxp 



(44) 



is a Gaussian smoothing function (with smoothing scale 
S = 20h~^ Mpc) which is used to suppress the impact of 
fluctuations in ^(rp,7r) at large separations (see JB04 for 
details). Wo compute £,(rp,Tv) in equal logarithmic bins of 
Tp (Alnrp — 0.23) and in equal linear bins of tt (Att = 
1.0h~^ Mpc). The 7r-integral in (43) is performed over the 
interval —50h~^ Mpc < tt < 50/i~^ Mpc, while we integrate 
rp from 0.lh~^ Mpc to 50h~^ Mpc. 

Finally we determine the rcal-spacc power spectrum 
P{k) and the PVDs ui2{k) by modeling the measured 
P^^'(fc,/i) using eq.(41) with /3 = 0.45. Detailed tests in 
JB04 and L06 have shown that keeping /3 fixed at this 
value yields reliable results. The best-fit values for ayiik = 
l/iMpc~^) thus obtained are listed in Table 3 and are shown 
in Fig. 10 as black, open circles. The errorbars are obtained 
from 8 mock redshift surveys (see §5.2 below) and indicate 
the expected scatter due to cosmic variance. 

Comparison of these results with those of JB04 and LOG, 
reveals good mutual agreement at M^j — 5 log h — 19. For 
fainter galaxies, however, our analysis yields PVDs that are 
~ 150 kms~^ lower than those of JB04, with the results of 



Don't confuse the tt's in this equation: The first tt is the usual 
3.14159..., while the other tt's indicate the separation along the 
line-of-sight 



L06, which are based on the SDSS, roughly in between. Since 
our analysis is identical to that of JB04, these differences 

reflect the slightly different selection criteria. Whereas JB04 
used flux-limited samples with 0.02 < z < 0.25, we use 
volume-limited samples with the restrictions that 0.01 < 
z < 0.20. Another potential source of this difference is the 
relative sensitivity to the exact scale at which the PVDs 
are measured. As can be seen from Fig. 7 in JB04, their 
cri2(fc) for galaxies with —18.5 < Mi,j — 51og/i < —17.5 
reveals a pronounced, sharp peak of ~ 725 kms~^ at fc = 
l/iMpc~^. However, at slightly higher or lower k, the PVDs 
are ~ 550kms~^ in much better agreement with our results 
and those of L06. 



5.2 Comparison with CLF Models 

In order to predict PVDs from our CLF, we construct de- 
tailed MGRSs using the CLF and cosmological A'^-body sim- 
ulations (dark matter only). These MGRS are constructed 
to be directly comparable to the 2dFGRS, as described in 
detail in Appendix B. We analyze these MGRSs using ex- 
actly the same procedure (described above) as used for the 
2dFGRS data, so that the model-data comparison is as fair 
as possible. 

The solid (red) line in the right-hand panel of Fig. 10 
indicates the PVDs obtained from the MGRS constructed 
from the best-fit CLF (called WMAP3a in Table 2). For 
A4b J — 5 log ft ^ — 19.5 this model predicts PVDs that are 
in reasonable agreement with the data. For fainter samples, 
however, the PVDs are clearly too low compared to the 
2dFGRS. This is in qualitative agreement with JB04, even 
though our analysis is for the WMAP3 cosmology, while that 
of JB04 was for a WMAPl cosmology. 

In order to probe the uncertainties on ai2 due to the 
uncertainties on the CLF parameters, ideally one would con- 
struct a MGRS for each of the 2000 models in our MCMC. 
Unfortunately, the construction of MGRSs and their subse- 
quent analysis, is computationally too expensive, rendering 
this unpractical. Instead, we proceed as follows. Since the 
mass-to-light ratio of clusters and the satellite fractions are 
the two model aspects that most strongly impact on ai2, 



© 2000 RAS, MNRAS 000, 1-23 



18 van den Bosch et al. 



Y 1200 h 
o 

Oh 

3iooo 



■l-> 

CO 
CM 

b 



800 



600 ^ 



400 4 



200 



T 



* JB04 

■ Li et al (2006) 
O This paper 



L 



1 



i -4. 



I I 



T — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — |- 

O This paper 

WMAP3a (x==95.7) 

WMAP3b (x2 = B9.4) 

WMAP3C (x== 147.4) 

WMAPSd (x= = 31.8) 




-17 -18 -19 -20 -21 -17 -18 -19 -20 -21 
M^-51ogh M(,^-51ogh 

Figure 10. The PVD measured at fc = Ih Mpc~^ using the Fourier analysis described in §5.1 as function of the median magnitude of 
the volume limited sample used. Open circles in both panels indicate the results obtained in this paper from the 2dFGRS (see Table 3), 
with the errorbars indicating the cosmic variance as obtained from 8 MGRSs. In the left-hand panel, we compare these to the results 
obtained by JB04 (red, solid triangles) and L06 (blue, solid squares) from similar analyses of the 2dFGRS and SDSS, respectively. We 
have converted the SDSS r-band magnitudes to bj band magnitudes assuming bj — r = 0.9 (see L06). In the right-hand panel, we 
compare our 2dFGRS results to those obtained from four different WMAP3 CLF models, as indicated (cf. Table 2 and Fig. 11). For 
completeness, we have indicated the formal value for each of these models. See text for detailed discussion. 



we have searched the MCMC for two models that more or 
less bracket the 95 percent confidence limits of our CLF 
model. The parameters of these models, called WMAPSb 
and WMAPSc, are Usted in Table 2, while Fig. 11 shows the 
mass-to-hght ratios and satellite fractions of these models. 

The dashed (blue) and dotted (green) lines in Fig. 10 
show the PVDs of models WMAP3b and WMAP3c, respec- 
tively. Model WMAPSb predicts significantly higher satel- 
lite fractions and lower cluster mass-to-light ratios than 
the best- fit model (WMAPSa). Consequently, the PVDs for 
faint galaxies are much larger, bringing them in much better 
agreement with the PVDs obtained from the 2dFGRS. In or- 
der to quantify the comparison between different models, we 
compute the formal using the 'cosmic variance' errorbars 
obtained from our 8 mock redshift surveys. This yields 
values of 95.7 and 89.4 for models WMAPSa and WMAPSb, 
respectively. Despite the clear improvement at the faint end, 
the goodness-of-fit of model WMAPSb is only marginally 
better than for model WMAPSa. This owes almost entirely 
to the fact that model WMAPSb severely overpredicts the 
PVD for galaxies with Mi,j — 51og/i ~ —20.4: this single 
data point contributes 80.9 to the total x^! 

As expected, model WMAPSc predicts PVDs that are 
even lower than in the case of model WMAPSa, in clear 
disagreement with the data (x^ ~ 147.4). It does, however, 
accurately match the PVD at Mtj -5\ogh ~ -20 A. This 
suggests that perhaps a model with a high /sat at the faint 
end, and a low /sat at the high end, could fit the PVDs 
at all luminosities. Model WMAPSd, which we extracted 
from our MCMC, meets these criteria (see Fig. 11), and 
indeed yields PVDs that are in reasonable agreement with 



the data (x^ = SI. 8). It does dramatically underpredict the 
PVDs at the bright end, but since the corresponding (cosmic 
variance) errors are huge, the contribution to the total x^ is 
only modest. 



Thus we conclude that, within the WMAPS cosmology, 
one can find halo occupation models that can provide a rea- 
sonable, simultaneous fit to the luminosity dependence of 
the clustering strength and the luminosity dependence of 
the pairwise velocity dispersions. However, this does come 
at a price. The best-fit model (WMAPSd) is an extreme 
model within the MCMC; this is evident from both Table 2 
and Fig. 11, which show that model WMAPSd has model 
parameters, mass-to-light ratios and satellite fractions that 
differ substantially from the best-fit model. Furthermore, 
this model still does not fit the PVDs completely satisfac- 
tory. In particular it does not reveal a pronounced minimum 
in (712(1/), as observed. In fact, we have tested a number of 
additional models from our MCMC with similar /sat(i) as 
model WMAPSd, but none fair any better in this respect. 
We have also tested models with a 'random', rather than a 
'deterministic' sampling of Lccn, but this does not have a 
significant impact on the PVDs. Although we only tested a 
hand-full of models (selected in a strongly biased way), we 
therefore conclude that the detailed shape of the luminosity 
dependence of the PVDs remains a challenge for the halo 
occupation models. In the next section we discuss possible 
implications of these findings. 
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Figure 11. Some predictions of four models discussed in the text (lines) overplotted on the 68% and 95% confidence limits from the 
marginalized distributions of the WMAP3 MCMC. The left-hand panel shows the average mass-to-light ratio as function of halo mass. 
Contrary to Fig. 4, here we plot (M/L), not )M/Lig{, with (L) given by equation (2). The right-hand panel shows the satellite fraction as 
function of galaxy luminosity. Model WMAPSa corresponds to the best-fit model in the MCMC, while models WMAPSb and WMAPSc 
roughly outline the extrema of (M/ L)m and of /sat (L). Finally, model WMAPSd, whose mass-to-light ratio is almost identical to that of 
WMAPSb, has an extremely strong gradient in /sat(i). As shown in Fig. 10, this is the model that best fits the luminosity dependence 
of the PVDs. 



6 CONCLUSIONS 

Using the conditional luminosity function (CLF) formalism, 
and data from the 2dFGRS, we have constrained the uni- 
versal relation between light and mass. Using a Monte-Carlo 
Markov Chain we probe the complete parameter space of our 
models, and provide confidence limits on all derived quan- 
tities. With respect to our previous CLF studies we have 
made the following changes and improvements: 

• We have adopted a flat ACDM cosmology with param- 
eters advocated by the 3 year data release from the WMAP 
mission. 

• We have modeled the 2dFGRS data on its light cone. 

• We have taken the scale dependence of the halo bias 
into account. 

• We no longer impose that the mass-to-light ratios of 
haloes with M > 10^'^h~^ Mq are constant. 

The change in cosmology (lower Sim, lower erg and lower 
spectral index) causes a reduction in the mass-to-light ratios 
of dark matter haloes ranging from ~ 25 to ~ 45 percent, 
depending on the mass scale. As anticipated, this removes 
an important problem with previous CLF and HOD models 
which had a tendency to predict mass-to- light ratios for clus- 
ters that were too high (van den Bosch et al. 2003b, 2005b; 
Yang et al. 2004; Tinker et al. 2005; Vale & Ostriker 2006). 

Taking account of the light-cone modeling and the scale 
dependence of the halo bias only has a mild impact on our 
results, improving the accuracy of our models by ~ 5 to 10 
percent. We emphasize, though, that the impact of these 
effects can be much larger when using data out to higher 
redshift, or when using clustering data on smaller scales, 
compared to what we have used here. 

We have compared various predictions of our CLF 
model with results obtained from our 2dFGRS group cata- 



logue. We found excellent agreement for the average mass- 
to-light ratios, the luminosities of central galaxies as func- 
tion of halo mass, the faint-end slope of the CLF, the occupa- 
tion numbers in various luminosity bins, and the luminosity- 
gap statistic. The fact that these two completely different 
methods yield results in such good agreement, and for such 
a wide variety of statistics, is a major success for both the 
CLF formalism and for the halo-based group finder of Yang 
et al. (2005a). The CLF model also predicts that the satellite 
fraction decreases with increasing luminosity, in excellent 
agreement with the HOD analyses of Tinker et al. (2006b) 
and Cooray (2006), as well as with the constraints obtained 
by Mandelbaum et al. (2006) from a galaxy-galaxy lensing 
analysis of the SDSS. 

One outstanding issue regarding the mass-to-light ra- 
tios regards the actual slope of {M/L)m at the massive end 
(M > lO^^/i"^ Mq). While the group catalogue of Yang et 
al. (2005a) yields mass-to-light ratios that continue to in- 
crease roughly as (M/L) a/ cx K'P with 7 = 0.33 ± 0.05, an 
alternative group catalogue by Eke et al. (2004), also based 
on the 2dFGRS, predicts that 7 — > at the massive end. Un- 
fortunately, the clustering data used to constrain the CLF 
can not discriminate between these different values for 7. 
Although recent, independent studies seem to favor some- 
what intermediate values of 7 ~ 0.2 ± 0.08 (e.g., Popesso 
et al. 2005), the fact that two group catalogues constructed 
from the same data set yield such wildly different results, 
accentuates the need for more thorough investigations. 

We also presented a detailed description of the link be- 
tween the CLF and the more often used HOD models. In 
particular, we have shown how to compute the full halo- 
occupation distribution, P{N\M), from the CLF for any 
range in luminosities. In addition, we have compared the 
shape of (A'') m predicted by our CLF models with that typ- 
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ically assumed in HOD models. Although they agree qual- 
itatively, the HOD models typically adopt a zero-to-unity 
transition for {N)m which is much sharper than what we 
predict with our CLF. This implies that the CLF predicts 
probability distributions P{AI\Lccn) that are much broader 
than what is typically assumed in HOD models. The amount 
of scatter in P(M|Lcen) plays an important role in the in- 
terpretation of weak lensing measurements and of satellite 
kinematics. In More et al. (2006, in preparation) we present 
a strict lower limit on uiog m , obtained from satellite kine- 
matics, which rules out values for CTiogM lower than ~ 0.2. 

Finally, we have studied the luminosity dependence of 
the pairwise velocity dispersions, ai2, of 2dFGRS galaxies. 
Using ten volume limited samples, we obtain that g\2{L) 
reveals a local minimum at Mb j — 5 log ft ~ —20.4, in good 
qualitative agreement with Jing & Borner (2004) and Li 
et al. (2006). At the faint end, however, we obtain PVDs 
that are ~ 150 km s^^ lower than those of JB04. Since we 
used exactly the same analysis technique as JB04, these dif- 
ferences must reflect the different selection criteria. Using 
detailed mock galaxy redshift surveys we compared these 
(Ti2 (I/) with predictions from our CLF models. In agreement 
with previous studies (e.g.. Mo et al. 1993; Slosar et al. 2006; 
Tinker 2006c) we find that the PVDs are extremely sensi- 
tive to the satellite fractions, /sat(i), and to the (cluster) 
mass-to-light ratios. This is good news since the clustering 
data used to constrain the CLF leaves relatively large uncer- 
tainties regarding these quantities. Simultaneously match- 
ing ro(i) and ai2{L) therefore allows us to strongly tighten 
the constraints on parameter space. In particular, for the 
WMAP3 cosmology used here we find that ai2(L) requires 
models with relatively low mass-to-light ratios for clusters 
{(M/L)ci ^ 2157i (M/L)0) and with a satellite fraction that 
decreases from ~ 45 percent at Mi,j — 51og/i = —18.5 to 
~ 10 percent at Mbj - 5 log ft = -20.5. 

In terms of the likelihood distributions obtained from 
our MCMC, these values are typically >, 2a away from the 
median, indicating that our CLF model is not capable of 
accurately fitting ro{L) and ai2{L) simultaneously. In par- 
ticular, we were unable to find a CLF model in our MCMC 
that could reproduce the pronounced minirrmm in (Ji2{L) 
at Mbj — 5 log ft ~ —20.4. This suggests either (i) that we 
are dealing with the wrong cosmology, or (ii) that the CLF 
parameterization used here is not sufficiently general. Al- 
though we certainly can't rule out this latter option, we 
believe that option (i) is the more likely cause for this out- 
standing problem. This is motivated by some of our previous 
results. In Yang et al. (2004) we used our CLF formalism 
and the PVDs measured by Hawkins et al. (2003) to argue 
against the WMAPl cosmology and in favor of a cosmol- 
ogy with Qra ^ 0.3 and an ~ 0.75. The WMAP3 cosmology 
adopted here, has = 0.238 and as = 0.744. Lowering Qrn 
and/or erg reduces the abundance of meissive haloes, which 
in turn implies that lower cluster mass-to-light ratios are 
needed in order to explain the observed PVDs. The fact that 
the WMAP3 cosmology studied here requires relatively low 
values for (M/L)ci, while our WMAPl studies required rel- 
atively high cluster mass-to-light ratios, therefore suggests 
a cosmology with values for and/or as intermediate be- 
tween those of the WMAPl and WMAP3 cosmologies. We 
leave it for future studies to see whether indeed such a cos- 
mology can yield a CLF that can simultaneously match the 



ro{L) and cti2(L) with realistic model parameters. As a fi- 
nal note, however, we wish to emphasize that the combined 
constraints from ro{L) and ai2{L) are extremely tight, thus 
offering great potential to constrain both cosmological pa- 
rameters and halo occupation statistics. The CLF formalism 
presented here is ideally suited for such a task. 
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APPENDIX A: THE 2DFGRS GROUP 
CATALOGUE 

Throughout this paper, we compare various CLF predic- 
tions with results obtained from our 2dFGRS group cata- 
logue. The construction of this catalogue is based on a halo- 
based group finder, which is described in Yang et al. (2005a, 
hereafter YMBJ). This group finder is optimized to group 
together those galaxies that reside in the same dark mat- 
ter halo, and has been tested in great detail against mock 
galaxy rodshift surveys (YMBJ; Yang et al. 2005b; Wein- 
mann et al. 2006a,b) 

Contrary to most other studies, we do not determine 
the group masses from the velocity dispersion of the group 
members. Instead, we estimate the group masses from the 
group luminosity Lis, defined as the total luminosity of all 
group members brighter than Mt,j — 5 log h = —18. Detailed 
tests have shown that this method is significantly more ac- 
curate than using the velocity dispersion of group members 
(see Appendix B of Weinmann et al. 2006a). For distant 
groups, in which not all galaxies above this magnitude limit 
are brighter than the flux limit of the survey, we correct Lis 
for the missing members using an empirical self-calibration 
based on the groups that are sufficiently nearby (see YMBJ 
for details). Finally, to convert from Lis to a halo mass M, 
we make the assumption that there is a one-to-one relation 
between Lis and M. For each group we determine the num- 
ber density of all groups brighter (in terms of Lis) than the 
group in consideration, and we then use the halo mass func- 
tion for the WMAP3 cosmology to find the value of M for 
which the more massive haloes have the same number den- 
sity. Note that this has the disadvantage that the group mass 
is cosmology dependent. However, it can easily be converted 
to any other cosmology, using the relation 

poo POO 

/ n{M')dM' = I n{M')dM' (Al) 

J M J M 

Here M and n(M) are the mass and halo mass function in 
the WMAP3 cosmology, and M and n{M) are the corre- 
sponding values in the other cosmology . 

An obvious shortcoming of this method is that the true 
relation between Lis and M contains some scatter, which 
thus results in errors in the inferred group masses. However, 
detailed tests with mock galaxy redshift surveys have shown 
that this method nevertheless allows for a very accurate re- 
covery of average halo occupation statistics. In particular, 
the group finder yields average halo occupation numbers 
and average mass-to-light ratios that are in excellent agree- 
ment with the input values (Yang et al. 2005b; Weinmann 
et al. 2006b). 

Application of this group finder to the 2dFGRS, yields a 
catalogue consisting of 77, 708 groups, which in total contain 
104, 912 galaxies. Among these, 7251 are binaries, 2343 are 
triplets, and 2502 are systems with four members or more. 
The vast majority of the groups (66, 612 systems) in our cat- 
alogue, however, consist of only a single member. Note that 
some faint galaxies are not assigned to any group, because 
it is difficult to decide whether they are the satellite galaxies 



Table Al. Parameters of Galaxy Groups in 2dFGRS 



logA^ log(Leon> log(Li8> (A^is) (Nig) {N20) 

/i-^Mq /i-^Lq h-'^LQ 

11.56 9.57 9.63 0.535 

11.88 9.77 9.86 1.238 1.015 

12.20 10.02 10.13 1.416 1.203 0.032 

12.50 10.19 10.33 1.879 1.441 0.914 

12.79 10.32 10.53 2.715 1.904 1.004 
13.06 10.41 10.69 3.795 2.603 1.385 
13.31 10.47 10.85 5.311 3.722 1.845 
13.55 10.53 11.00 8.580 5.258 2.282 
13.77 10.58 11.15 14.43 7.492 2.947 
13.96 10.62 11.28 21.55 10.74 3.801 
14.14 10.67 11.41 28.65 13.95 5.094 
14.30 10.75 11.52 43.72 19.87 5.903 
14.44 10.80 11.62 70.00 24.22 8.323 
14.58 10.85 11.69 73.01 28.08 6.852 
14.69 - 11.77 112.5 25.12 10.08 

14.80 - 11.82 - 47.65 7.219 

Column (1), (2) and (3) list the group mass, the average luminos- 
ity of the central group galaxy, and the average, total luminosity 
of all group galaxies with Mj^ — 51ogh < —18. Columns (4), (5), 
and (6) list the average number of galaxies, per group, brighter 
than M[, J — 51og/i = —18, —19 and —20, respectively. Note that 
the group masses are only valid for the WMAP3 cosmology used 
here. However, it is straight forward to convert these numbers to 
any other cosmology. 

of larger systems, or the central galaxies of small haloes. Ta- 
ble Al lists a number of average properties of these groups, 
as function of the assigned group mass. These properties 
have been used in this paper for comparison with our CLF 
predictions. 



APPENDIX B: MOCK GALAXY REDSHIFT 
SURVEYS 

We construct MGRSs by populating dark matter haloes with 
galaxies of different luminosities. The distribution of dark 
matter haloes is obtained from two large A''-body simula- 
tions of AT = 512^ dark matter particles each. These simula- 
tions have been carried with PKDGRAV, a tree-code written 
by Joachim Stadel and Thomas Quinn (Stadel 2001). Each 
simulation evolves the distribution of the dark matter in 
the WMAP3 ACDM cosmology {Qm = 0.238, = 0.762, 
= 0.042, h = 0.73, as = 0.75, Us = 0.951). The ini- 
tial conditions are generated with the GRAFIC2 package 
(Bertschinger 2001), which also computes the transfer func- 
tion as described in Ma & Bertschinger (1995). The two 
simulations have periodic boundary conditions and box sizes 
of lOO/i"^ Mpc and 300/i~^ Mpc, respectively. The particle 
masses are 4.92 x 10** /i"^Mq and 1.33 x 10^° h'^MQ for 
the small and large box simulations, respectively. In what 
follows we refer to the simulations with Lbox = WOh~^ Mpc 
and Lbox = 300/i~^ Mpc as Lioo and L300, respectively. 

We follow Yang et al. (2004) and replicate the L300 box 
on a 4 X 4 X 4 grid. The central 2x2x2 boxes, are replaced 
by a stack of 6 x 6 x 6 Lioo boxes, and the virtual observer 
is placed at the center (see Fig. 11 in Yang et al. 2004). This 
stacking geometry circumvents incompleteness problems in 
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the mock survey due to insufficient mass resolution of the 
L300 simulations, and allows us to reach the desired depth 
of = 0.20 in all directions. 

Dark matter haloes are identified using the standard 
FOF algorithm with a linking length of 0.2 times the moan 
inter-particle separation. Unbound haloes and haloes with 
less than 10 particles are removed from the sample. The re- 
sulting halo mass functions are in excellent agreement with 
the analytical halo mass function of Sheth, Mo & Tormen 
(2001). Wc populate the dark matter haloes with galaxies of 
different luminosity using our CLF. Because of the mass res- 
olution of the simulations and because of the completeness 
limit of the 2dFGRS, wc adopt a minimum galaxy luminos- 
ity of Lmin = 10 h^^ Lq. Thc halo occupation statistics of 
these galaxies follow from the CLF as described in §4. Lu- 
minosities are drawn using either the 'deterministic' or the 
'random' sampling method described in §2.4, whereby we al- 
ways assume that the central galaxy is the brightest galaxy 
in its halo. 

The positions and velocities of the galaxies with re- 
spect to the halo center-of-mass are drawn assuming that 
the brightest galaxy in each halo resides at rest at the center. 
The satellite galaxies follow a number density distribution 
that is identical to that of the dark matter particles, and 
are assumed to be in isotropic equilibrium within the dark 
matter potential. To construct MGRSs we use the same se- 
lection criteria and observational biases as in the 2dFGRS, 
making detailed use of the survey masks provided by the 
2dFGRS team (Colless et al. 2001; Norberg et al. 2002). 
We also mimic fiber collisions and image blending as de- 
scribed in detail in van den Bosch et al. (2005b). The final 
MGRSs accurately match the clustering properties, the ap- 
parent magnitude distribution and the redsliift distribution 
of the 2dFGRS, and mimic all the various incompleteness ef- 
fects, allowing for a direct, one-to-one comparison with the 
true 2dFGRS. 
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